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BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates generally to translation methods and apparatus, and 
more particularly to translation methods and apparatus over a network. 

Description of the Related Art - 

In most computer systems involving a central host processor and 
numerous distributed access devices such as video display terminals, information 
is transferred between the host and each access device via a screen display formed 
as an integral part of the access device. The screen serves the two-fold purpose of 
displaying input information provided by a user as well as displaying user- 
readable output information, generated by host processing, to the user. The input 
information is generally provided by the user via entries on a keyboard also 
formed as an integral part of each access device. Input or output information is 
typically composed of an arrangement of system-provided words or phrases 
followed by user or system-supplied data fields displayed in a predetermined 
pattern on the screen. 
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In conventional systems, an application executing on the host utilizes only 
a limited number of screen patterns or formats so that only a standard set of screen 
images corresponding to the formats may be called into view by the user for input 
or invoked by the host processing for output. The definition of each screen format 

5 is typically deeply embedded in the source code for the application. There is 
essentially no flexibility provided to the user to allow for the creation of 
customized formats and, correspondingly, their screen images. In order to expand 
the user base of a previously developed software application system, particularly 
to allow foreign affiliates of the system developer full utilization of system 

10 capabilities, major modifications to the source code of the system have 

conventionally been required, such as by rewriting significant portions of the code 
implementing input/output (I/O) interface functions. 

The foremost modification in the above special situation is that of 
translating the descriptive words or phrases from the original language (e.g., 

1 5 English) to a different language (e.g., Spanish). If there are affiliates from 

numerous foreign countries then, besides the effort of rewriting the source code, 
multiple copies of modified source code require storing, tracking and updating. 
Such a task becomes unwieldy, burdensome and costly. Thus, for example, if a 
source code module uses or produces user-viewable information, then there must 

20 be a different copy of the module for each language executable by that software. 
Besides the actual system copies of the code, support software is required to 
inform the system developer of the status of the multiple copies. Moreover, 
additional storage devices are needed to store all the additional versions of the 
software. For a large scale system involving millions of lines of code and 

25 thousands of modules, the storage requirements may become enormous. 

In addition to the problem of direct language translation, there are also 
problems as how to treat the data supplied to the data fields. It is usually required 
that certain data be converted, such as by converting from non-metric to metric, 
and other data be reordered, such as month/day/year versus day/month/year which 

30 is the convention of some foreign affiliates. 

A translation environment which serves to buffer the host system to each 
of the access devices is disclosed in U.S. Patent No. 4,870,61 0. Broadly 
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speaking, the translation environment includes an autonomous processor 
interposed between the host system and each access device. Information 
transmitted in either direction between the host and access device is diverted to the 
processor for intermediate processing. The diverted information contains detailed 
5 character data either appearing on the input request screen originated at the access 
device or on the output response screen destined for the access device, depending 
upon the direction of original information transmission. The character data is of 
two types, namely, system-supplied field identifiers and user-provided data entries 
associated with the identifiers. Identifiers are expressed in a first user language 

10 (e.g., English). 

In order to offer access to the host system by a user of a second language 
(e.g., Spanish), the screen displays and, most particularly, the identifiers are first 
translated to the second language via a format create process. The output of this 
create process is a translation file which stores the mapping relationship between 

1 5 the first language screen and its second language counterpart. The translation file 
is invoked by a translation execution process whenever the second language user 
accesses the host system. The contents of this file are used to translate from the 
second-to-first language upon a host request and from the first-to-second language 
upon a host response. A feature of this arrangement is that both the format create 

20 process and the translation execution process operate in the translation 

environment which is transparent to the host system. With the translation 
environment, the user may customize screen displays to maximize system 
utilization. 

U.S. Patent No. 5,966,685 is directed to a system of parallel discussion 
25 groups operated in conjunction with a message collection/posting software 
program, data filter software program, and a machine translation software 
program. A structure and process is created to enable discussion group users, of 
different languages, to communicate with one another. An automatic batch 
process is utilized that executes at a remote site. No human intervention is 
30 required for the pre-processing, translation, or post-processing functions. 

Additionally, users simply specify a language preference to realize the benefits 
and advantages of the present invention. 
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A number of discussion groups run in "parallel"; one group for each 
language being used in the discussion groups. The individual discussion groups all 
contain the same information, in the same order; the only difference being that 
each parallel discussion group is written in a different language. Once a user logs 
5 onto a particular parallel discussion group he or she may then choose his or her 
language preference. If the user's language preference is set to French, the French 
version of the discussion group will be accessed. Messages posted to a discussion 
group will be periodically collected, translated to the other languages, and then 
posted to those respective target language discussion groups. The collection and 

10 posting of the messages will be accomplished by the Message Collection/Posting 
Software The new messages which are collected on a periodic basis are sent to a 
commercially available Machine Translation (MT) software for translation. 
Messages are batch processed automatically at the network site and without 
human intervention. The translation takes place at a remote site so user actions are 

1 5 minimized. 

Before the input text is actually submitted to the MT software, the input 
text is passed through a filter software program which preprocesses the data before 
it is submitted to the MT software. The filter identifies and marks strings which 
are best left untranslated by the MT software, such as personal names, company 

20 product names, file and path names, commands, samples of source code, and the 
like. By marking these strings, the filter notifies the MT software to leave those 
strings untranslated. These strings are then linked to a preceding "hookword". 
Hookwords are automatically inserted then deleted in post-processing and are 
contained in dictionaries with a part-of-speech and other grammatical features to 

25 effect rearrangement of the word in the target language. Once the translation 
process is complete, the translations are collected and posted, by the Message 
Collection/Posting Software, to the target language discussion groups at the same 
location within the message structure as the original version of the message. The 
pre-processing, translation, and post-processing functions are all performed 

30 automatically in accordance with a batch process that executes on a periodic basis 
at the network site. 
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U.S. Patent No. 5,960,382 discloses a method and apparatus for translating 
a native-language message into a corresponding foreign-language message. 
Translation of an initially-unknown message is effected using native-language and 
foreign- language prototype messages that are independent of message variables, 
5 whereby a prototype message represents all messages of an individual type. An 
individual message is identified to belong to a particular type by using the native- 
language prototype message, and an equivalent foreign- language message is then 
generated by inserting variable values from the individual message into the 
foreign-language prototype message that represents the particular message type. 

10 The native-language message, which includes a value of a variable, is 

matched against a plurality of native-language prototype messages to identify a 
corresponding native-language prototype message, which includes the variable. 
The plurality of native-language prototype messages preferably represent all 
native- language messages that require translation. The identification of the 

15 prototype native- language message is used to obtain (e.g., retrieve) a 

corresponding foreign-language prototype message, which also includes the 
variable. 

The value of the variable, obtained from the native-language message that 
is being translated, is then substituted for the variable in the obtained foreign- 

20 language prototype message to yield a foreign-language message which 

corresponds to (i.e., which is a translation of) the native- language message. If the 
native language message includes values of a plurality of variables, the identified 
native-language prototype message and the corresponding foreign- language 
prototype message each includes the plurality of variables. The plurality of the 

25 variables have a first ordering in the identified native-language prototype message 
and a second ordering in the corresponding foreign-language prototype message, 
and the two orderings are generally different. 

The substitution step then involves using the first ordering and the second 
ordering to determine a placement of the values of the variables into the obtained 

30 foreign-language prototype message. Preferably, the matching step involves the 
use of a multi-tiered multi-node tree constructed from the native-language 
prototype messages, and matching strings (e.g., words and numerals) which make 
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up the native-language message in their order against the nodes of corresponding 
tiers in the tree to reach a node which represents the last string in the message and 
contains the message identifier of the corresponding prototype message. This 
identifier is then used to obtain the corresponding foreign-language prototype 
5 message, which has the same identifier. 

There is a need for a language translation method and apparatus for 
networks that addresses issues associated with two-bit characters. There is a 
further need a language translation method and apparatus for networks that 
provides for user feedback. There is a further need for a language translation 
1 0 method and apparatus for networks that limits subject matter and language usage 
domains. Yet there is another need for a language translation method and 
apparatus for networks that takes advantage of application-specific characteristic 
repetitions in language. 

15 STTMMARY OF THE INVENTION 

Accordingly, an object of the present invention is to methods and 

apparatus that improve the quality and usability of the translations. 

Another object of the present invention is to provide language translation 

methods and apparatus suitable for the internet and other distributed networks. 
20 Yet another object of the present invention is to provide language 

translation methods and apparatus that provide user feedback. 

A further object of the present invention is to provide language translation 

methods and apparatus for the internet that limits subject matter and language 

usage domains. 

25 Another object of the present invention is to provide language translation 

methods and apparatus for networks that takes advantage of application-specific 
characteristic repetitions in language. 

Another object of the present invention is to provide language translation 
methods and apparatus for the internet that provide user feedback relative for 

30 determining whether inputs are translated correctly. 
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Another object of the present invention is to provide language translation 
methods and apparatus that actively educate users on how to use translation 
engines. 

Another object of the present invention is to provide language translation 
methods and apparatus with user defined dictionaries. 

Another object of the present invention is to provide language translation 
methods and apparatus that provide user direction and customization of 
translation. 

Another object of the present invention is to provide language translation 
methods and apparatus that permit modification in order to better handle the 
characteristic language of different specific applications. 

Another object of the present invention is to provide language translation 
methods and apparatus that provide a static translation cache of frequently 
encountered phrases, 

Another object of the present invention is to provide language translation 
methods and apparatus that provide a key form for storing cached phrases which 
removes extraneous information 

Another object of the present invention is to provide language translation 
methods and apparatus that provides for the most flexible and productive 
application of a phrase cache. 

Another object of the present invention is to provide language translation 
methods and apparatus that provide typing shortcuts for languages and allow free 
deletion of selected clauses. 

Another object of the present invention is to provide language translation 
methods and apparatus that provide simultaneous display of original and translated 
content or messages from a network on a single screen without disrupting the look 
and feel. 

Another object of the present invention is to provide language translation 
methods and apparatus that provides for personalization. 

Another object of the present invention is to provide language translation 
methods and apparatus that is real-time and does not cause delay for the user. 
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Another object of the present invention is to provide language translation 
methods and apparatus that indues multiple translation engines in a single system 
with a uniform API. 

Another object of the present invention is to provide language translation 
5 methods and apparatus that provides a uniform API to numerous applications. 

These and other objects of the present invention are achieved in a method 
for electronically translating text. An electronic language translator is provided. 
Source language text is received as an input to the electronic language translator. 
The source language text is translated at the electronic language translator at the 
10 time of submission into one or more target language texts. A user is then provided 
with an option of viewing one or more of the target language texts with or without 
the source language texts. 

In another embodiment of the present invention, a method for 
electronically translating text provides an electronic language translator system 
1 5 that includes an electronic language translator and at least a first and a second 

dictionary. The electronic language translator references the first dictionary and 
then the second dictionary in a process of translating source language text into one 
or more target language texts. The dictionaries are maintained in an application or 
customer hierarchy. Source language text is received at an input of the electronic 
20 language translator. The source language text is translated at the electronic 

language translator into one or more target language texts. An output is produced 
that includes the one or more target language texts. 

In another embodiment of the present invention, a method for electronic 
language translation provides one or more translation modules receiving 
25 source language text from an input interface. One or more input interfaces 

and one or more output interfaces are provided. A generic data format is 
included that is independent of the translation modules, input interfaces, and 
output interfaces. The input source language text is converted from the format 
for a specific input interface to the generic format. A determination is made 
30 of the one or more translation modules that provides an optimal translation. 

The text is routed to the module that provides the optimal translation. Text is 
converted from the generic data format to a specific input format of a 
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translation module. The specific output format from a translation module is 
converted to the generic data format. Data is converted from the generic data 
format into an output format suitable for an output interface. 
In another embodiment of the present invention, a method for 
5 electronically translating text provides an electronic language translator 

coupled to an interface. Source language text is translated at the electronic 
language translator into one or more target language texts. Translated text is 
output in one or more target languages to an output interface. Controls are 
provided at an interface coupled to the electronic language translator to 
1 0 dynamically select which of the one or more target languages are output at the 

interface. The interface representation of text is varied in the one or more 
target languages to allow a user to differentiate between the displayed 
languages. Controls are provided at an interface to create differentiation 
between one or more target languages. 
1 5 In another embodiment of the present invention, a method for 

electronically translating text provides an electronic language translator 
coupled to an interface. The source language text is translated at the 
electronic language translator into one or more target language texts. The 
translated output is displayed to the original user. Feedback is provided to the 
20 original user about the quality of the translation. 

In another embodiment of the present invention, a method for 
electronically translating text provides an electronic language translator coupled to 
an interface. The source language text is translated at the electronic language 
translator into one or more target language texts. At least two candidate 
25 translations are produced for each source language text. The translated candidates 
are compared to one or more language models trained on data similar in style and 
subject matter to the text being translated. The best quality translation is selected 
for the input from the multiple translation candidates according to which best 
matches the one or more language models. A desired best quality translation is 
30 then displayed. 

In another embodiment of the present invention, a system for 
electronically translating text includes an electronic language translator that 
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receives source language text input and produces translated target language text. 
An interface is coupled to the electronic language translator and configured to 
provide a user with an option of viewing one or more target language texts with or 
without source language text. 

5 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 describes a screenshot of a chat application of the present 
invention. 

Figures 2-3 describe embodiments of the real-time multilingual 
10 communication application of the present invention. 

Figure 4 is a high level overview illustrating operation of the System of 
the present invention. 

Figure 5 is a high level overview illustrating operation of the System of 
the present invention. 

15 Figure 6 is a high level overview illustrating operation of the System of 

the present invention. 

Figure 7 illustrates data flow between the client wireless device, the 
wireless network provider, the data provider, and the translation services of the 
present invention. 

20 Figure 8 illustrates a generic peer-to-peer data exchange. 

Figure 9 illustrates a generic client-server data exchange, where the 
translation service of the present invention acts as the intermediary in the data 
exchange 

Figures 10(a)- 10(b) are directed to an auction tool embodiment of the 
25 present invention which combines features of the multi-lingual search engine and 
the browsing tool of the present invention 

Figure 1 1 shows the dynamic translation cache of the present invention. 
Figure 12 illustrates the framework of the present invention. 
Figure 13 illustrates the five steps performed at the translation 
30 Figure 14 illustrates the procedures that each step of transation undergoes 

of the present invention. 



Attorney Docket No. 24 1 76-703 1 0 

H:\private\wpdocs\pd\amikai\703app42.doc 



Figure 15(a) illustrates the full linguistic processing occurring in the chat 
application of the present invention. 

Figure 15(b) is similar to Figure 15(a) with the same data path, for non- 
interactive applications of the present invention. 
5 Figure 16 illustrates input aids incorporated into the System of the present 

invention. 

Figure 17 illustrates a common phrase table with the static translation 
cache of the present invention. 

Figure 1 8 illustrates language post-processing of the present invention. 
10 Figure 19 illustrates text post-processing of the present invention. 

Figure 20 illustrates text pre-processing of the present invention. 

Figure 21 is a continuation of Figure 8 illustrating language pre- 
processing of the present invention. 

Figure 22 illustrates the translation engines with dictionaries of different 
1 5 types of the present invention. 

Figure 23 shows a topic specific dictionary of the present invention. 

Figure 24 illustrates the four types of feedback produced and utilized by 
the System of the present invention. 

Figure 25 shows the levels of feedback which are incorporated in the 
20 translation system of the present invention. 

Figure 26 illustrates the way the System of the present invention educates 
the user about the MT engine, as well as about his or her own language. 

Figure 27 describes the browsing tool of the present invention on a high 
level as a three-step process: 
25 Figure 28 is a high-level blowup of step 2 from Figure 27. 

Figure 29 provides more detail of step 2 from Figure 27 for page retrieval 
and processing. 

Figure 30 is a blowup of page retrieval as represented in Figure 29. 
Figure 3 1 is a blowup of step 1 from Figure 30 describing how parameters 
30 are added to a URL before querying the source site. 

Figure 32 is a blowup of step 2 from Figure 30. 

Figure 33 is directed to page rewriting of the present invention. 
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Figure 34 illustrates page rewriting as a two-pass process of the present 
invention. 

Figure 35 is a graphical illustration of the page rewriting process 
described in Figure 34. 
5 Figure 36 is a blowup of pass 1 from Figure 37. 

Figure 37-39 give examples of how certain elements are handled in pass 1. 
Figure 40 describes instances in the page where the browsing tool rewrites 
URLs in the page of the present invention. 

Figure 41 illustrates how text is translated as part of the browsing tool of 
1 0 the present invention. 

Figure 42 is a blowup of the final stage from Figure 30 for handling 
incoming cookies of the present invention. 

Figure 43 shows the Help button of the present invention. 
Figure 44 illustrates the restatement window of the present invention. 
1 5 Figure 45 shows an example of the interactive tutorial entry of the present 

invention. 

Figure 46 is a blow-up of the different types of the tutorials of the present 
invention. 

Figure 47 shows an example of warning lights of the present invention. 
20 Figure 48 shows how users' expectations and knowledge about the System 

are influenced through actual use of the System of the present invention. 

Figure 49 illustrates the tutorial daemon of the present invention. 

Figure 50 depicts the input length meter of the present invention. 

Figure 51 is an example of shortcuts of the present invention. 
25 Figure 52 shows example of emoticons of the present invention. 

Figure 53 is an example of the translator of the present invention. 

Figure 54 illustrates a number of personalization features of the present 
invention. 

Figure 55 illustrates the splash page of the present invention. 
30 Figure 56 illustrates the plurality of different features in the chat 

application of the present invention. 
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Figure 57 illustrates the different features in the chat room of the present 
invention. 

Figure 58 illustrates the Help process of the present invention. 
Figure 59 illustrates the "current member's box" of the present invention. 
Figure 60 illustrates switching language zones of the present invention. 
Figure 61 is an overview of the browsing tool of the present invention. 
Figure 62 lists some of the browsing tool features of the present invention. 
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DETAILED DESCRIPTION 



In one embodiment of the present invention, a system ("System") for 
electronically translating text includes an electronic language translator that 
receives source language text input and produces translated target language text. 
5 An interface is coupled to the electronic language translator and configured to 
provide a user with an option of viewing one or more target language texts with or 
without source language text. The electronic language translator translates the 
source language text to at least one target language at the time of submission of 
the source language text. An output interface outputs the target language text 
1 0 from the electronic language translator. The output interface can vary an interface 
representation of text in the one or more target languages. 

The electronic language translator can include at least first and second 
dictionaries. The electronic language translator references the first dictionary and 
then the second dictionary in a process of translating source language text into one 

15 or more target language texts. The dictionaries are maintained in an application or 
customer hierarchy. A generic data format can be included that is independent of 
the translation engines, input interfaces and output interfaces. 

In one embodiment, a conversion module converts the input source 
language text from the format for a specific input interface to a generic format. A 

20 routing module then determines which translator provides an optimal translation 
and then routes the text to that translator. A conversion module converts text from 
the generic data format to a specific input format . A conversion module converts 
a specific output format from the translation engine to the generic data format. A 
conversion module can be included to convert data from the generic data format 

25 into an output format suitable for an output interface. 

The System of the present invention has a variety of different applications 
including but not limited to translation of text, real time translated chat, website 
content, e-mail, instant messaging, multi-lingual auctions and marketplaces, and 
the like. 

30 The present invention allows multiple people to engage in an online 

translated text conversation. Users can define their input and view languages and 
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Chat applications of the present invnetion translates input sentences from one user 
to the appropriate output languages defined by each of the other users. A 
screenshot of a chat application of the present invention is illustrated in Figure 1. 
In one embodiment, the present invention invention is for casual chat between 
5 users on a portal or community site, intra-company communication on a corporate 
intranet, business-oriented chat on a business-to-business exchange, and real-time 
customer support solutions, among other uses. 

Figures 2-5 illustrate one emboidment of a real-time multilingual 
communication methods and apparatus of the present invention. Figure 2 

1 0 illustrates how different users use the method and apparatus of the present 

invention. Illustrated is a two-person interaction model. In this model, two people 
communicate exclusively with each other, sending messages back and forth. Each 
message is sent to the chat server, translated (if the message is textual), and 
relayed to the other user. The second diagram illustrates multiple-user 

1 5 communication where multiple people message each other in one room. In this 
model, every message that is sent by any one chat client is captured by the chat 
server, translated (if the message is textual), and rebroadcast to every chat client 
in the chat room. 

Figure 3 illustrates the several types of messages that travel within a chat 
20 room. One is a plain text message which is translated instantaneously into any of 
the chat room's supported languages. The next type is iconic. These are 
significant because when dealing with universal language there is a need to 
transmit messages that are understood universally. Another type of message is 
meta-transactional whose sole purpose is to facilitate the entire process of 
25 communication. One example of a meta-transactional message is the "Help?" 
message, which one user may send to a second user alerting him that she did not 
understand his message, and requesting that he restate it in a way that may be 
translated more effectively and thereby more easily understood. 

Figure 4 is a high level overview illustrating operation of the System of 
30 the present invention. A user uses their browser to access the web server of the 
present invention. The web server delivers a page which contains an applet. The 
applet appears. From that point forward communication between the user and the 
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System occurs exclusively through that applet. The user inputs a message in the 
applet which is sent to the chat server. When the message is textual, the chat 
server sends the message to the translation System, which translates the message 
and sends it back to the chat server. At this point, the message contains versions 
5 of the user-entered text in all the supported languages of the chat room. The 
server then retransmits the message out to all users in the chat room. 

Figure 5 illustrates the type of feedback that occurs. Throughout the 
process a feedback loop runs continuously. As messages are received by the 
translation System, there is interaction between 1) the database for storage of 

1 0 information of the messages, as well as 2) with the machine translation engines. 
The database stores our static translation cache, which contains many text phrases 
pre-translated across multiple languages. These translations are performed by 
humans and are thereby guaranteed to have perfect quality. In addition, the cache 
modifies itself over time by reacting to patterns it observes in the types of 

15 messages it receives. This results in higher translation quality. Text elements that 
are not handled by the cache, are sent to the translation engines for translation. 
Each box in the figure represents a single machine translation language direction, 
e.g. English to French. As a result, the translation System utilizes multiple 
translation engine components. Different providers can be used for different 

20 language pairs. Some engines can support multiple directions. 

The present invetnion also provides a translated web browsing tool that 
provides machine translations of website content. One example is translation of 
all text on a website into a language defined by a user. The present invention also 
provides methods and apparatus that translate text embedded within graphics on a 

25 website. The browsing tool of the present invention can be useed both on an 
actual website and as a downloadable tool that plugs in directly to the user's 
browser. The actual tool itself includes a toolbar that resides on the top or bottom 
of the user's browser screen and gives the user functional control over the 
language of translation, the URL of the website the user wishes to access, as well 

30 as a number of other features such as a way to submit the current site for human 
translation. A screenshot of the browsing tool of the present invention is shown in 
Figure 6. 
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The browsing tool of the present invention is primarily a tool that provides 
individual users with access to Internet content that they would not be able to 
access with the tool. Examples of applications of the browsing tool include but 
are not limited to education, entertainment, research and the like. 
5 Figure 7 illustrates data flow between the client wireless device, the 

wireless network provider, the data provider, and the translation services of the 
present invention. 

Generally, the client wireless device is any personal mobile electronic 
device with a display/output apparatus, an input apparatus, and data transmission 

1 0 capability which is designed to serve as a mobile terminal for Internet and other 
network transactions. Examples include but are not limited to: cellular phones 
with data transmission and display capabilities, personal handyphone systems 
(PHS), personal digital assistants (PDAs), palmtop computers, and 
Internet/network capable appliances and devices. The wireless network provider is 

15 the data transmission infrastructure which allows the client devices to exchange 
data with each other and with any other devices accessible over the network. The 
data provider is any device which supplies either static or dynamic data to the 
client device over the data transmission infrastructure. The present invention acts 
as an intermediary in this data exchange, translating the data from one language to 

20 another as it passes from client device to data provider, from data provider to 
client device, or from client device to client device. 

The wireless translation applications of the present invention are 
substantially equivalent to the internet translation applications. Some of the 
differences between the two include, the data is encoded in WML (wireless 

25 markup language), HDML, or some other standard for wireless data exchange, 
rather than HTML, the target end-user device is a data-capable cellular phone, 
personal computer, or other wireless data terminal rather than a desktop or laptop 
computer and the data is transmitted over the data network of the cellular service 
provider instead of/in addition to being transmitted over networks such as the 

30 Internet. 
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In a typical untranslated peer-to-peer transaction, a client wireless device 
may send a data transmission over the wireless data transmission infrastructure 
and network, where it is routed to another client wireless device. 

Figure 8 illustrates a generic peer-to-peer data exchange where the 
5 translation service of the present invention acts as the intermediary in the data 

exchange. The present invention is integrated with the wireless data infrastructure 
and network. As data is sent from a client wireless device to another client 
wireless device over the wireless network server, that data is passed to the present 
invention which translates/processes/transforms that data and returns it to the 

10 wireless network server to be routed to the destination wireless device. Examples 
of data transmissions which fit this peer-to-peer model include SMS (short 
messaging system) messages, alphanumeric pager messages and the like. 

In a typical untranslated client/server transaction, a client wireless device 
may send a request for data over the wireless data transmission infrastructure and 

1 5 network, where it is routed to a data provider server. That server replies with the 
requested data, which is returned to the client wireless device over the wireless 
data transmission network. Figure 9 illustrates a generic client-server data 
exchange, where the translation service of the present invention acts as the 
intermediary in the data exchange. A client wireless device formulates a request 

20 for data from a particular server; this request is then forwarded to the present 

invention. The translation service accesses the wireless data and services specified 
in the client request and translates/processes/transforms that information before 
returning it to the requesting end-user. Examples of this client/server model 
include WAP data browsing, server push data, and the like. 

25 Further, the methods and apparatus of the present invention provide can 

provide draft quality translations of text emails. Users simply type their email in 
their own language and it is translated by into the target language of the person the 
email is being sent to. The translation can also take place on the side of the 
receiver, when someone receives an email in a language he or she may not be 

30 familiar with. 

Instant messaging is designed as a communication platform for people 
who are accessing networks, including but not limited to the Internet, 
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concurrently. When someone receives a message while offline, the message is 
stored for them to view the next time they log in to the Internet. Translated instant 
messaging can be used in corporate communication, customer service, student 
interaction, and any other situation requiring instantaneous communication across 
5 a language barrier. 

The System of the present invention can include a multilingual search 
engine that allows someone who speaks one language to search for information on 
the Internet or on a specific site that is in a different language. A query can be 
entered in one language and the search engine of the System translates the query 

10 into the target language before searching for matching information. In another 
embodiment, the System also designs a mechanism that can resolve ambiguous 
search queries by asking the user for more input in potentially ambiguous 
situations. The multilingual search engine can be used to search for information 
on the Internet in general, or to search for a product or piece of information on a 

15 specific website or domain of information. Examples of potential uses include but 
are not limited to searching for a certain type of business outside of a country on 
an informational website, searching for a certain type of product on a foreign 
ecommerce site, or searching the entire Internet for websites related to a certain 
topic that are written in a language not native to the user. 

20 Methods and apparatus of the present invention can tie in directly with 

online auction and marketplace sites. This solution allows users of the 
marketplace or auction to post messages or product descriptions in such a way that 
they are easily viewed and translated into a number of different languages. Form 
fields and drop-down menus can be used that limit the number of choices a user 

25 has when describing a certain product. This allows for storage of the posted 
information in a format that can be easily transferred to any language. 

Figures 10(a) and 10(b) are directed to an auction tool embodiment of the 
present invention which combines features of the multi-lingual search engine and 
the browsing tool A user can enter a query in Language A, even though the site 

3 0 itself is in Language B . 

For example, a user could enter in Japanese "osara" which means plate. 
That gets translated into the proper language; in the case of an English- language 
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site, "osara" is translated into "plate." The regular query is run on the auction 
site's database. From the auction site's side the interaction is the same as a 
regular, monolingual search. They haven't changed their processing at all. The 
pages that are returned are translated by the System of the present invention and 
the links are shown along with the translated version of the links. When a user 
clicks on a link to see the actual auction, the auction page comes up and is 
completely translated. 

The auction site does not have to change its database lookup or change the 
pages they push. All of the translation management, including preparing text for 
translation, executing translations, and displaying translated versions of pages is 
handled by the System, and is completely transparent to the auction site. 

Figure 1 1 shows the dynamic translation cache, which records recently 
translated sentences and is dynamically updated with each translation call. When 
a translation is requested, the dynamic translation cache is consulted first to see if 
the requested sentence was translated recently. If so, the recorded translation can 
be returned immediately, saving time and processing cycles on the translation 
engine. This is significant for many applications of the translation System, but in 
particular for the auction tool. In auction searches, users will often enter 
successive queries that are very similar to each other, varying the keywords only 
slightly. This causes the same auctions to be returned repeatedly. The present 
invention capitalizes on this repetitive behavior with a dynamic cache that keeps a 
record of all the recent translations. This is done in a manner similar to the 
common phrase table, and similarly takes advantage of the characteristic 
repetitions in the application's language use. 

In another embodiment, the present invention integrates translated text 
and translated search into a product, allowing users of the marketplace or auction 
to search for goods and information in multiple languages. The results can then be 
displayed in their own language. 

Along with the specific features needed to multilingualize an auction or 
market site, the present invention also provides users with custom dictionaries and 
common phrase lists tailored to their particular applications. This is especially 
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effective where translations relate to a limited topic area, such as for a specialty 
goods auctions site. 

Users can have direct access to the System, web-sites and interfaces of the 
present invention. Additionally, the System of the present invention permits users 
to act as hosts. 

Referring now to Figure 12, the framework of the present invention is 
illustrated and includes an interface, distribution and translation layers. A user 
uses the interface layer to construct an initial input and view the translated output 
as the user engages in multilingual communication. Each application can have a 
unique interface that maximizes the effectiveness of translated communication in a 
particular domain. In one embodiment, the methods and apparatus of the present 
invention utilize Java-based input and output interfaces. However, the present 
invention can also integrate with user interfaces of existing business applications, 
making it easy to empower existing applications with the capability of 
multilingual communication. 

In one embodiment, the present invention provides outpt interfaces that 
differ among applications, can be primarily Java-based, and handle output in all 
supported language pairs. The output interface displays languages even to users 
who do not have an operating system that is native to the language of the output. 
To do this, the System can utilize all Java Unicode character strings. Depending 
upon the user's system, it may request that the user allow a short procedure that 
installs appropriate fonts and writes to certain configuration files on the user's 
system. This installation procedure enables the user's system to display fonts that 
are not native to their operating system. 

After the input is received the interface layer forwards the request to the 
distribution layer for further processing. The distribution layer serves as a conduit 
between the user Interface and the translation layer. Specifically, the distribution 
layer provides language pair distribution, load balancing and is a common 
interface to the translation layer. For language pair distribution, the distribution 
layer ensures that the translation queries are passed on to the correct translation 
engines of the System based on the language-pair of the translation request. The 
Distribution Layer utilizes a load-balancing system that manages the load of each 
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instance of the translation engines. For every language pair, the System of the 
present invention can create multiple instances of the translation engine. The 
Distribution Layer ensures that the queries are distributed efficiently among the 
different instances of the engine. As a common interface to the translation layer 
5 the interface layer will vary depending on the software application. Applications 
with widely differing user interfaces can all utilize the translation layer in the 
same manner 

Translation is performed at the translation layer which can include the five 
steps illustrated in Figure 13. Procedures that each step undergoes are more fully 

10 explained in Figure 14. 

Figures 15(a) and 15(b) illustrates the full linguistic processing that occurs 
in the chat application specifically. As the user inputs something a series of input 
aids are available to help the user type in inputs. For example, typing in Japanese 
is very slow. There is tension between the desire to type and input as fast as 

1 5 possible especially in the chat application, while at the same time making the input 
language as clean and proper as possible for the translation engine. The method 
and apparatus of the present invention minimizes the problems of monolingual 
input, which is usually too poor quality to translate well. This is achieved by 
providing a series of tools that draws a balance between the two extremes of fast 

20 input and strictly grammatical language. Further detail on input tools is given in 
Figure 16. 

A static translation cache, also called a common phrase table, is provided. 
This includes phrases that are frequently repeated in chat applications (or whatever 
the specific application of the machine translation is); the phrases are stored with 

25 perfect translations. Items in the cache go directly to post-processing without 
going through the translation or other engines. Further detail on the static 
translation cache is given in Figure 17. Finally, there is some post-processing. 
Further detail on the post-processing is given in Figures 18 and 19. 

In step 1 the text input is converted into a state that can be translated. 

30 Text inputs differ among applications. It is important for every input be distilled 
down to a form that can be synthesized by subsequent steps which are application- 
independent. Therefore, step 1 is application-dependent. In step 1, the following 

Attorney Docket No. 24 1 76-703 22 
H:\private\wpdocs\pd\amikai\703app42.doc 



actions occur: unnecessary whitespace within the text are removed, improper 
capitalization are removed, which are later restored in step 5, excess punctuation is 
removed, again restored in step 5, the input is spell-checked, the input is grammer 
checked. Certain contractions are removed from the input. In the case of 
5 translated HTML browsing for the translated browse feature of the present 

invention, the pre-processing operates differently. There, the pre-processing step 
handles the task of parsing the HTML, preparing the appropriate text for 
translation, and then reformatting the resulting translations while preserving the 
form of the original HTML page. The text pre-processing step for HTML 

10 translation is described in more detail further below. As this pre-processing step 
distills the input, it retains information about the input's original state that is later 
restored to the translation in Step 6, as the translated output is produced. 

Figure 20 illustrates text pre-processing. Text pre-processing can remove 
white space, remove and retain capitalization information, remove and retain 

15 punctuation information, and rewrite contractions. 

In step 2 the input is analyzed for special linguistic structures such as 
synonymous words, extraneous expressions and common phrases. This is also the 
level where the System will examine the input for any potential ambiguities that 
the user would need to resolve. In cases such as translated search, where the 

20 System employs a feedback to the user mechanism, any such ambiguities would 
be resolved at this level before the input passes to the next step. After performing 
the linguistic analysis and determining what phrases need to be translated, the 
language pre-processor determines what calls to make to step 4, as well as which 
of those calls to make to the static translation cache or to the translation engines, 

25 as described below. 

Figure 21 is a continuation of Figure 20 illustrating the language pre- 
processing. This includes removing extraneous expressions (such as "well" or 
"so"), rewriting slang, rewriting abbreviations, and dissecting compound phrases 
into analyzable units. 

30 The text pre-processing and language pre-processing described in Figures 

20 and 21 includes (but is not limited to) the following specific steps: 
• Remove capitalization, preserving the information; 

Attorney Docket No. 24 1 76-703 23 
H:\private\wpdocs\pd\amikai\703app42.doc 



• Remove punctuation, preserving the information; 

• Standardize spacing; 

• Remove commas, hyphens, and other sentence-internal punctuation for 
cache matching; 

• Parse out names of other users from beginning or end of sentences; 

• Parse out connecting words and expressions such as "well," "oh," "ah," 
"well then" (English), "pues," "si" (Spanish), "eh bien" (French), " ," " " 
(Japanese), etc.; 

• Attempt to chop sentences at commas and semicolons to match each half 
with cache; 

• Attempt to correct common spelling errors for cache matching; 

• Expand contractions, preserving data (English, German, French); 

• Rewrite abbreviations for cache matching, e.g. 

o "r" = "are", "4" - "for" (English) 
o "2" = "de", "9" = "neuf ' (French) 
o "k" = "que", "t" = "te" (Spanish) 

• Add dropped from , (Japanese); 

• Chop off ending particles ("gobi") (Japanese); 

• Expand to , (Japanese); 

• Attempt to drop accents for cache matching (French, Spanish); 
Use alternate spelling for cache matching (German). 

Step 3 is where the lowest-level translation occurs. Depending upon the 
content of the text input, the translation step employs one of two subsystems: 

Static Translati on Cache 
In this embodiment, a translation cache stores commonly submitted inputs 
and their translations for extremely fast lookup. The motivations for the static 
translation cache are, translation quality, speed and scalability. By utilizing a 
cache, the present invention is able to specify perfect translations for a large 
number of the most commonly submitted text inputs. Such inputs include 
colloquialisms, slang and common phrases in each language, as well as specialized 
phrases that are common in specific client applications and industries. By 
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accumulating commonly accessed inputs and their translations in a high-speed 
cache, the present invention increases the speed of common translations and 
thereby improves the user experience. By minimizing the number of calls that the 
Systemof the present invention makes to the translation engines, scalability and 
stability are improvided. 

The cache functions by grouping phrases that have similar meanings and 
then associates a single canonical phrase with each group. When performing a 
translation on any of those phrases, the cache returns the translation of that 
canonical phrase. The cache includes a database table of canonical phrases across 
all supported languages and a series of hashtables for each supported language. 
These are canonical phrases that have a version in every supported language. For 
example, the expression "Hello" is universal and has a version in all languages. 
Each hashtable stores phrases that may not have exact equivalents in the other 
languages, but can be approximated to one of the canonical phrases in the first 
table. The key of the hashtable is the common phrase, and the value is an index to 
the row in the first database table with the equivalent canonical phrase. Because 
the text has been pre-processed and distilled prior to handling by the cache, the 
lookup is not disturbed by minor textual differences in the input such as extra 
spaces or inadvertent punctuation. 

In chat applications a large number of common phrases, including but not 
limited to greetings, frequently repeated phrases, and chat lingo are stored in a 
table that lists translations for each of the phrases. This ensures fast, completely 
accurate translations for the most common phrases which people use in the chat 
environment. The phrases stored in this master table are called canonical forms. 
In addition, variants of each of these phrases in each language so that these will be 
recognized as well. These variants include contracted versions, versions with 
extraneous, non-content-bearing words ("Hello" vs. "Hello there"), and 
synonymous expressions ("I'm well" vs. "I'm fine."). The variants are stored in a 
standardized format called a key with all letters are downcased, punctuation 
removed, and spelling standardized to ensure that the widest range of user input 
will be recognized. 
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Figure 17 illustrates the common phrase table with the static translation 
cache. The processed input is received, and then converted to a key form by 
removing particulars of language usage. (This process includes many of the steps 
described in Figures 20 and 21.) There is a look up in the key table to see if the 
5 input is a common phrase. The key table then gives a reference into the canonical 
phrase table, which gives the output translation of the input in the appropriate 
target language. Punctuation, capitalization information, and the like are then 
restored. 

Third Party Translation Engines with the System of The Present Invention 

10 For text inputs that are not stored in the static cache, the System of the 

present invention sends the input to an appropriate third-party translation engine 
for processing. The present invention utilizes several translation engines to ensure 
that the quality of translation is optimal for each supported language-pair and 
treats each third-party engine as a virtual black-box. Different engines have 

15 different capabilities. A custom Java wrapper is written to each engine, which 
serves as a common API so that previous steps do not have to understand or 
interact with each engine's unique API. Each engine instance handles a single 
language pair and produces for each text input one translated output. 

Each third-party translation engine is treated as a distributed object and 

20 communicated by using the RMI protocol. The System of the present invention 
utilizes multiple instances of each translation engine running on numerous 
machines to minimize dependence upon the stability of any one single engine 
instance or machine. Further, the System of the present invention can be scaled 
simply by adding additional machines and connecting them to the distribution 

25 step, described previously. 

Because of the ambiguity of language, the quality of an MT engine's 
output is highly dependent upon the quality and relevance of the lexica it utilizes. 
To improve the quality of output lexica for each language is compiled for 
numerous topic areas, and the appropriate topical lexica is applied to each 

30 communication domain. For example, in its business chat rooms, the translation 
engines employ business-related lexica, whereas its sports rooms use sports- 
related lexica. 
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Figure 22 illustrates the translation engines with proprietary dictionaries 
of different types. These include topic specific dictionaries appropriate for the 
topic of the chat room, website, or other current application, proper name and 
proper noun lists that the method and system of the present invention frequently 
update to make the translations in each application as current as possible. Users 
are also able to make their own dictionaries. 

In step 4 the various fragments of the original input are reassembled after 
having been split apart in step 2. Some parts of the input may have passed through 
a translation engine, while others were routed to the static translation cache. 

Figure 18 illustrates the language post-processing (restoration) stages 
where separate units are constructed and text is reconstructed. This is includes 
restoring certain abbreviations and reconstructed units that were separated during 
the language pre-processing in Figure 21. 

Step 5 restores the textual changes that were made to the input in step 2. 

In Figure 19, text post-processing (restoration) occurs with punctuation, 
contractions, and capitalization restored as appropriate. This step generally 
restores the information extracted in Figure 20. The text is then prepared for 
display and output. 

The translation layer includes customized dictionaries. One type of 
customized dictionary is topic specific where the topic is specified either 
automatically by the topic of the chat room, or manually by the user as he/she uses 
the browser or search engine as illustrated in Figure 23. These topic-specific 
dictionaries include ones provided by the translation engine itself. 

The dictionaries of the present invention improve the level of translation 
for specialized topics. Additionally, for general translation language and topics of 
the dictionaries are maintained topical and current. 

The present invention updates a dictionary of proper nouns with their 
correct translations or transliterations for all necessary language pairs. In the chat 
setting, this allows discussion of the most current topics. In the browser and 
search engines users will are able to keep up with the fast-moving nature of web 
sites on the Internet. 
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The translation layer also includes user specific dictionaries. Users are 
also encouraged to assemble personalized lexica. These allow users that use 
specialized language not handled well by general dictionaries to specify the 
desired translations. Users that have familiarity with a language other than their 
5 own can build this dictionary directly. If a user lacks this ability, the present 
invention invention provides a tool for speakers of two different languages to 
specify jointly the proper translation of a term for the dictionary. In addition, a 
second feature allows a person to store a word in his/her personal lexicon, notify a 
professional translator, and have the correct translation of the expression added to 
10 in his/her dictionary at some point in the next day or two. 

The System of the present provides a filter that scans all input to the chat 
application for slang, idioms, chat lingo and problematic constructions. The filter 
expands or rewrites these specialized phrases to expressions in a form that can be 
better translated by the translation engine. The filters can be constantly updated to 
15 keep up with current slang and chat language 

The present invention provides feedback to enable the users of the System 
to judge and respond to the quality of the translated output. There are four types 
of feedback produced and utilized by the System. These are illustrated in Figure 
24. 

20 Figure 25 shows the many levels of feedback which are incorporated in 

the translation System of the present invention. The System incorporates different 

type of feedback to improve the quality and usability of the translations. 

User-User Feedback: The "Help?" button provides a mechanism for other 

users to say whether an input was translated in an understandable fashion. This is 
25 especially important for monolingual users who otherwise have no way of 

knowing whether their inputs are being translated correctly. 

System-User Feedback: The System incorporates warnings, suggestions, 

and both static and interactive tutorials to actively educate users to use the 

translation engines as productively as possible. 
30 User-System Feedback (direct): Users are able to direct the translations 

through a number of means, such as "do not translate" lists, "do not translate" 

markers in the input line, and user-defined dictionaries. 
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User-System Feedback (indirect): User activity directs modifications that 
the developers make to the translation System. For examples, users will report 
poorly translated words and phrases, and developers will also monitor user- 
defined dictionaries and "do not translate" lists to find items to add to the System 
dictionaries. 

This feedback is important because many people are unfamiliar with 
machine translation (MT), and so the different feedback cycles help to educate 
users about the strengths and limitations of MT. In addition, the feedback 
provides mechanisms for users to control and personalize the performance of the 
MT engines, so this gives them a greater sense of control and allows them to view 
MT as a useful tool instead of something mysterious. 

Figure 26 illustrates the way the System educates the user about the MT 
engine, as well as about his or her own language. People believe that they are 
experts on their native languages, however most people have limited knowledge 
about their native language and how it works. The tutorial informs users about the 
elements in their language which are likely to be ambiguous or difficult to 
translate, such as slang expressions, idioms, and certain words and constructions 
("got", "se", etc.) Interacting with the MT engine itself also shows users who 
understand at least some of the target language the strengths and limitations of the 
System, and helps educate them about the most productive use of the translation 
engines. 

User feedback to other users is from the recipient of the translated output 
to the original sender. For example, a user who receives an incomprehensible 
message can tell the original sender that he or she did not understand the message. 
This immediately prompts the sender to rephrase the message in a form that can be 
more easily translated. A person receiving an instant message from a colleague, 
realizes that part of the translated message is not very clear. The receiver can 
immediately prompt the sender to rephrase the difficult part of the original 
message. 

User feedback to the translation system is feedback that the recipient of 
the translated output gives to the translation System. Over time, as large amounts 
of translation data are accumulated, the present invention can use this data to 
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improve the quality of the translation System. This can occur manually or 
automatically with the System. System feedback to the user occurs in a negotiated 
translation when the System and the user together attempt to resolve ambiguities 
in a translation. System feedback is especially critical when text entries are short 
as in search queries. When a user enters a query in one language whose meaning 
is ambiguous, the System can respond by prompting the user to select from a list 
of ambiguity resolving options. Without this type of feedback highly accurate 
translated search queries are not possible. 

Text-processing step for HTML page translation is different from plain 
text translation. The present invention parses HTML pages, and provides 
placement of translations in the HTML page. There are two options for HTML 
page translation, show both original translation and show only the translation. 
When the original and translation are both shown the translations are preferably 
inserted into the original page without disrupting the form of the page. Tthe 
System then parses the HTML page and finds key markers which delineate 
appropriate locations for inserting translations. When only the translation is 
shown, the original text is replaced entirely by the translations. 

igure 1 5(b) is similar to Figure 1 5(a) with the same data path, but is for 
non-interactive applications such as the browser or auction tool. Wireless 
communication would also fall under this same grouping. 

Figure 27 describes the browsing tool on a high level as a three-step 
process: 1) the user makes a page request, 2) the request undergoes processing by 
our System, and 3) the System returns the response page to the user. 

A user request consists of a URL and a language pair— source language 
and target language. The source language is the original language of the page, and 
the target language is the language that the page is translated into. The source 
language may become optional as a language identifier is incorporated into the 
browsing tool. The request may also include cookies previously set by the web 
site associated with the page request and other parameters, including but not 
limited to form parameters which can be forwarded on with the request. 

Figure 28 is a high-level blowup of step 2 from Figure 27. It describes the 
overall processing which occurs between the users' page request and the page 
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response. Three steps are included: extract parameters from the user request, 
perform page retrieval and processing, and return the processed page within a 
dynamically-generated page. 

Figure 29 provides more detail of step 2 from Figure 28 for page retrieval 
and processing. There are three main types of page requests. The first is a user- 
specific page: These pages are never cached because it is assumed that their 
content is changing too often for them to be effectively cached. An example of 
such a page is a user profile page. They are always newly retrieved and rewritten 
on each new request. The second type is a non-user-specific page that has been 
cached. If a page is not user-specific and has already been cached, it is pulled 
from the cache and returned. The third type is a non-user-specific page that has 
not been cached or whose cache entry is out-of-date. These pages are newly 
retrieved and rewritten. In addition, they are stored in the cache for future queries. 

Figure 30 is a blowup of page retrieval as represented in Figure 29 In 
order to fulfill a user's page request the browsing tool of the present invention 
must first request the page from the source web site. In order to do this, it must 
first extract necessary information from the user's request, create a new second 
request, and then utilize this second request to query the source site. The page 
retrieval process consists of five steps: 

1) Add parameters to URL. Here, any parameters contained in the user's 
page request are added to the URL of the second request. 

2) Handle outgoing cookies: Cookies contained in the user's page request are 
forwarded to the second request. 

3) Perform HTTP request on new URL: This is where the source site is 
queried. 

4) Retrieve page: Using the appropriate character encoding for the page 
(based upon its language), the page is retrieved. 

5) Handle incoming cookies 

Figure 3 1 is a blowup of step 1 from Figure 30 describing how parameters 
are added to a URL before querying the source site. An important thing to note is 
that this process is language-sensitive. Specifically, when a user is viewing pages 
with source language A in target language B, parameters which represent user 
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inputs are translated from language B to language A before being added to the 
page request. This enables the users to actively interact with pages that are not in 
their own language. For example, if a user is viewing an auction site whose 
source is language A in language B, and the user wishes to enter a search query in 
language B for a particular object, that search query is translated into language A 
before being submitted to the page. 

Figure 32 is a blowup of step 2 from Figure 30. Cookies which are passed 
in as part of the user request are rewritten to drop the path prefix of the present 
invention, thereby restoring the original path of the cookie, and the browsing tool 
includes the cookie when querying the original source site. 

Figure 33 is directed to page rewriting and illustrates why the browsing 
tool of the present invention is unique. The browsing tool kit enables the user to 
insert the translations inline, in order to view both the original text and the 
translation simultaneously. Additionally, the browsing tool preserves the look and 
feel of the original page. This is accomplished by carefully positioning the 
insertion of translations at strategic locations within the page so that they do not 
significantly shift or displace original content. While the page is made inherently 
longer, its overall look and feel is not disrupted. 

Referring now to Figure 34 page rewriting is a two-pass process. In the 
first pass the page is traversed and translation placeholders are inserted in places 
where translations should be later added. Simultaneously, a list of text strings 
which must be translated is extracted. In the second pass the text strings which 
have at this point been translated are now inserted into the page, replacing the 
placeholders. Figure 35 is a graphical illustration of the page rewriting process 
described in Figure 34 

Figure 36 is a blowup of pass 1 from Figure 37. In pass 1, the HTML 
page is traversed and HTML elements are encountered. Each HTML element is 
handled uniquely. Certain HTML elements represent textual elements which 
require translation, while other elements contains links which must be rewritten. 
Still other elements require other handling. Figures 37-39 give examples of how 
certain elements are handled in pass 1. 
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Figure 37 illustrates handling of normal text. Normal text is defined as 
text positioned outside of any HTML tags. Normal text is handled in two ways: 
1) It is copied to the rewritten page, and 2) It is added to a cumulative text buffer 
to be later translated and inserted into the rewritten page. Both steps are required 
because the browsing tool displays both the original text and the translated text in 
the page. So the first step preserves the content and location of the original text 
piece. The second step causes the text to be translated and inserted into the page. 

It is important to note that in the second handling step the text piece is 
added to a buffer and later translated and inserted into the page, rather than 
immediately translated and transferred to the page. In order for the translations to 
be inserted into the page in a way that does not disrupt the page's original look 
and feel, they must be strategically positioned. This requires that each text piece 
not be immediately translated and inserted following its original text counterpart, 
but rather that translations be grouped together and later inserted into the HTML 
page in an appropriate location. This results in a more coherent page and a better 
user experience. 

Figure 38 illustrates handling of JavaScript. When a JavaScript block is 
encountered: 1) it is scanned for text strings which require translation, and these 
strings are replaced with placeholders in the JavaScript; 2) these strings are 
translated; 3) the placeholders are replaced by the newly translated strings; and 4) 
the new JavaScript block is copied over to the rewritten page. Unlike normal 
HTML where the original text and the translated text are conveyed, JavaScript 
blocks only convey translations, since most JavaScript text strings represent single 
text elements which can only have a single value. 

Figure 39 illustrates handling of the translation identifier. Translation 
identifiers are HTML tags that signify the end of a contiguous chunk of text, 
representing a position where a translation of previous text should be inserted. 
When a translation identifier is encountered, the contents of the text buffer 
(described above for Figure 37 and composed of text strings which are 
encountered within the page) are sent off for translation, a placeholder is added to 
the rewritten page and the text buffer is cleared. 
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Figure 40 describes instances in the page where the browsing tool rewrites 
URLs in the page. When URLs representing textual content are encountered, they 
are rewritten through the server of the present invention. URL rewriting ensures 
that as the user clicks through to subsequent pages, these pages continue to be 
translated as well. This provides a seamless user experience, allowing the user to 
browse and translate the web freely without any intermediate steps. This figure 
denotes specific cases where URLs are written to pass through the current 
invention. It is important to note that URLs are only rewritten if they represent 
textual content. URLs which represent other types of content, such as binary 
objects (images), should not be rewritten and should reflect the original source 
location. 

The content of a URL is rewritten through the servers of the present 
invention., by changing the URL to pass through serves of the System of the 
present invention, and the original source location becomes a parameter which is 
passed to the servers. This parameter denotes the page which the user is 
requesting. 

A relative URL is one in which the domain of the source location is not 
specified. With the present invention, the source's domain is added as a prefix to 
the URL, and then the URL is written as described in the preceding paragraph. 

Figure 41 is a blowup from Figure 35 for text translation. Figure 41 
illustrates how text is translated as part of the browsing tool. Multiple individual 
text strings which have been encountered during the page traversal process are 
concatenated. A single concatenated string is passed to the translation engine, 
which returns a single translated string. This single translation string is broken 
back up into multiple translations and returned. This approach enables all the text 
on a page to be translated using a single call to the translation layer. 

Figure 42 is a blowup of the final stage from Figure 30 for handling 
incoming cookies. This is the reverse of the process described in Figure 32. 
Cookies which are returned from the queried site are rewritten so that their path 
passes through the servers of the present invention and are then inserted into the 
page response and returned to the user. This ensures that the cookies will be 
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resent to the site whenever the user utilizes the browsing tool to access the same 
site in the future. 

The HTML page translation process of the present invention performs the 
following steps: 

5 (i) Performs text pre-processing on the HTML page, parsing the HTML 

page and producing a collection of text strings that should be translated. 

(ii) Performs language pre-processing on each of these text strings. The 
language pre-processor determines what, if any, textual elements within each of 
these strings needs to be sent on for translation. For each of these to-be-translated 

1 0 strings, it separates them into two groups, "a" those that should be handled by the 
third-party translation engines, and "b" those which should be handled by the 
static translation cache. For those in group a, the language pre-processor 
concatenates all of these to-be-translated elements into a single demarcated string 
to the third-party engine in step 4. By concatenating all of these strings into one, it 

15 limits the number of calls to the third-party engine for each HTML translation. 
For those in group b, the language pre-processor makes individual calls to the 
translation cache for each textual element. 

Upon receiving all translations from step 4, language post-processing 
occurs where the proper outputs are reconstructed. Text post-processing in step 6 

20 then reconstructs the HTML page, inserting translations in the appropriate 
locations and thereby preserving its original form. 

The System properly can handlesHTML pages with Javascript, Forms, and 
Cookies. The resulting page then operates as the original with no change in 
functionality. Additionally, the System can use optical character recognition 

25 technology to recognize the textual content of images, and provide a translation 
for text embedded in images as well as pure HTML text. 

In one embodiment, the framework is written primarily in Java, making it 
compatible with existing software applications and legacy systems. Because the 
framework is Java-based, it can run on any platform. The present invetnion can 

30 operate on Linux Pentium machines. The third-party translation engine can 
operate on a Unix, Linux or Windows NT platform on distributed machines. 
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A "Help" butoom can also be provided. With the present invention, the 
use of translation within a chat environment is provides users to give feedback 
about the understandability of a statement's translation. This feedback takes the 
form of a button connected with each posted message called the Help button 
5 which other users can click to indicate an unclear translation , see Figure 43.. 

After the Help button has been clicked, the user that made the original statement is 
notified of the bad translation and shown a screen with an editable copy of the 
statement as illustrated in Figure 44. This gives the user an opportunity to modify 
the statement to something more understandable for the System. In addition, the 
1 0 mistranslated statement can be sent through a grammar checker which can scan 
the input for a number of possible problems, including unparsable grammar, 
misspelled words, difficult-to-translate words or constructions, ambiguous words 
and the like. 

One of the problems with most machine translation systems is that there is 

15 no incorporation of feedback to guide the translation. At numerous stages in 
analysis there are decisions that must be made, often about the resolution of 
ambiguities. For example, there can be ambiguities that are lexical, syntactic, 
semantic and pragmatic. A user can also confuse a translation system by using, (i) 
a word not in the lexicon, (ii) a construction that is not in the grammar and (iii) a 

20 known word in a novel way or with a novel meaning. Confusion can be 
unintentional and caused by misspelled words, poor grammar; incorrect 
punctuation and incorrect characters, particualarly in Japanese and Chinese. 

For these ambiguities and confusing constructions, the Help button 
provides information about misunderstood sentences to the user and closes the 

25 feedback loop. This is done when another user does not understood the 

translation. Thus, the System of the present invention removes the burden of 
detecting the need for clarification from the computer. Furthermore, the System 
takes advantage of the communal nature of the chat room to allow users to help 
each other to find the best language for translation. 

30 The Help button can refer to either the full user comment or to a phrase or 

word within the comment. In the case where just a single word or phrase is 
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mistranslated, the other users can specify the specific part of the input which was 
confusing. 

As the user browses the web the translated browser of the present 
invention can automatically provide translation sites without requiring the user to 
5 specify the source language of each site. This is done by implementing a language 
identifier that scans a page, guesses the language and then executes the proper 
translation automatically. In the case where the identifier guesses wrong, or the 
page contains multiple languages, the user can override this feature, By removing 
any necessity for the user to worry about, or even be aware of, the source language 

10 of the materials he/she is looking at, the present invention makes the browsing 
experience as seamless as possible. 

The present invention can also provide a translation helper for the user. 
The translation helper is an interactive process with many functions including, 
instructing users on the proper use of the translation engine, helps users determine 

15 the best phrasing in order to achieve high-quality translation, and adjusts user 

expectations about the capabilities and limitations of machine translation. Users 
are trained to avoid these problematic constructions. This is done through both a 
passive approach, which attempts to provide instruction and information to the 
user, and an active element that reacts to user input to guide the user to better 

20 phrasing for translation. 

Each user is encouraged to read a list of suggestions and instructions for 
the best language and constructions to use to produce the best translations. There 
is a separate list for each of the languages. The present invention offers a number 
of formats for the information which allow the user to choose the level of detail 

25 and the conciseness of presentation which best suits his/her tastes. The formats 
the user can choose from are: 

1) A quick, bulleted list of points. This gives the basic information in an 
easy-to-read quick-reference format. 

2) A longer README- style file. This format gives an expanded form of 
30 the information with longer explanations and good and bad examples to illustrate 

each point. The mascot is featured in amusing cartoons to make each point more 
memorable. 
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3) An interactive tutorial. In this version, the the present invention guides 
the user interactively through a number of examples to communicate the 
information in a fun, memorable way as shown in Figure 45. The interaction 
includes illustrations, small quizzes, good and bad examples, and areas to test 
5 examples with the translation engine. 

Figure 46 is a blow-up of the different types of the tutorials, such as a 
quick, bulleted list of points, a more thorough tutorial with good and bad 
examples, illustrations, and explanations, or an interactive tutorial with quizzes, 
games, and translation test areas. The more elaborate tutorials make the learning 
10 experience more memorable and fun. The method and apparatus of the present 
invention provides a level of interactivity with the user to assist the learning 
process. 

The present invention also provide tutorial daemons that are programs 
which run in the background and monitor the users' inputs. By monitoring a 

1 5 user's typing before the sentence is sent to the translation engine, the present 

invention helps to guide the user toward sentences that are more easily translated 
and warn them of dangerous inputs. When a problem is detected, it is marked in 
the text within the input box and a "warning light" comes up in an area of the 
screen dedicated to tutorial messages as illustrated in Figure 47. The tutorial 

20 daemon can includes a spell checker, a grammar checker, a difficult phrase flagger 
and an input length meter. 

Figure 48 shows how users' expectations and knowledge about the System 
are influenced through actual use of the System. For the chat application, as the 
user provides input the input goes through a tutorial daemon. The tutorial daemon 

25 is a second-level tutorial which runs in the background and gives feedback about 
the user's input. The tutorial daemon flags difficult words and phrases, 
troublesome constructions, spelling and punctuation errors, likely accent errors, 
troublesome zero-anaphora, unlikely part-of- speech sequences, and other possible 
sources of translation errors, in order to train the user. Further detail on the 

30 tutorial daemon is given in Figure 49. In addition, the user receives feedback from 
seeing the translations that come through and other users give feedback in the 
form of the "Help?" button. 
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Figure 49 illustrates the tutorial daemon, which provides a number of 
checking stages to provide feedback to the user. Before the user hits the enter 
button the tutorial daemon provides a warning of things to watch out for. The 
daemon includes (among other elements) a grammar checker, a spelling checker, a 
5 difficult-phrase detector, an input-length meter (to warn users about overly-long 
inputs), an ambiguity detector, and an ambiguity resolver, which uses local 
context to determine the meaning of ambiguous words and phrases. 

The spell checkers reports each word that does not appear in one of the 
active lexicons. It does this by checking the current input line at short intervals 
10 before the return key is hit, and marking, either by highlighting, underlining, or 

some other graphical notation, that an unknown word has been found. This allows 
users to filter out spelling errors, non-standard words and slang, as well as 
problematic proper nouns before they are sent to the translation engine. When a 
user right clicks on a questionable word, a list of suggested alternatives is 
15 presented to speed correction. 

The grammar checker checks grammer such as how punctuation is used. 
It also attaches part-of-speech tags to each word and checks to see if any unlikely 
tag sequences are detected. A questionable sentence or phrase is highlighted to 
notify the user that the user should rephrase the input if possible. Right clicking 
20 on the questionable phrase brings up an explanation of the problem and a possible 
suggestion for a fix. Examples of the grammer checker include checking to make 
sure every sentence in Japanese has a subject and verb, and if question words have 
the proper accent marks as in Spanish. 

Aa number of languages have words and phrases that are not 
25 grammatically incorrect but are difficult to translate. Examples of such 

difficulties include"no" and "suki" in Japanese, the impersonal passive with "se" 
in Spanish; "got" in English, "marche" in French.. The difficult phrase flagger of 
the present invention highlights these to encourage the user to rephrase the 
sentence for better translation. A right click on the problematic expression brings 
30 up an explanation and a list of preferable rewordings. 

An input length meter is also provided with the present invention. 
Because translation quality declines with longer sentences it is important to keep 
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input as short as possible. As a constant reminder of this, a small input-length 
meter is displayed next to the input text box in the chat application as illustrated in 
Figure 50. The input box is periodically checked to see how many words, or in 
Asian languages, how many characters, have been entered, and increases the meter 
5 reading accordingly. Certain words, such as conjunctions, push the meter's needle 
up even further. After a certain word count, the meter enters a red "Danger Zone" 
which warns the user that their input is much more likely to be mistranslated. The 
Danger Zone level depends on the language and engine being used. 

In contrast, in the context of Search Application of the present invention, 

10 inputs which are too short are the problem. In this setting, a daemon watches the 
queries a user inputs, and issues a suggestion if a number of one- or two-word 
queries are entered in succession. In cross-language search a major obstacle is the 
difficulty in translating the exact meaning of the search terms. The context of a 
number of search terms aids significantly in determining the exact meaning of the 

15 query words. The average number of words per query as reported in most studies 
is usually around two, so without encouragement most users will tend to enter 
these short queries and will probably become discouraged by the poor search 
results. 

In both the chat and search applications the present invention checks the 
20 input for potentially ambiguous words and phrases. These ambiguous expressions 
are highlighted to encourage the user to rephrase the input for a clearer translation. 
Without this feedback, the user will often have no idea why a sentence or query 
produced such a bad translation. The ambiguous words can be detected by 
consulting a specialized word-translation dictionary which lists specific alternate 
25 translations for a word. Ambiguous phrases can be detected either by scanning for 
specific phrases (such as a "yes" or "no" following a negative question) or by 
executing a part-of-speech tagging and seeing if there are multiple tag sequences 
judged likely. 

Once an ambiguity has been detected, an ambiguity resolution program 
30 can be triggered, either with a right click in chat, or automatically in search. The 
resolver can either consult surrounding context or other search terms to determine 
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the most likely sense of the ambiguous word, or it can spawn a dialogue box to ask 
the user for clarification directly. 

In order to track translation problems and provide feedback for 
refinements to the help files, lexicons, and tutorial daemons, the method and 
5 apparatus of the present invention logs every word which passes through the 
translation engine untranslated and also logs every input which receives Help 
button feedback from another user. These logs permit immediate recognition of 
any patterns in mistranslation which occur, including words missing from the 
lexicon, constructions not covered in the translation System's grammar, and 

10 frequent grammatical and spelling errors. 

The present invention provides a number of aids and shortcuts to help 
users enter their input quickly and correctly. These include an iconic entry which 
provides a shortcut for input in the chat application. A user clicks on a series of 
special icons which immediately insert certain set phrases into his/her text entry 

15 box. These icons take three different forms serving different purposes. 

Figure 16 illustrates the wide variety of input aids incorporated into the 
System. These include typing short cuts (either by keyboard or mousing on a 
separate menu), emotions, a hyperlinked dictionary, buttons which introduce long 
phrases into the chat or other application, special characters which set apart text 

20 which is not to be translated, lists of words and phrases which are not to be 
translated, and automatic recognition of URLs in the text. 

The present invention also provides typing shortcuts. The chat 
environment requires fast input and quick reaction to maintain a fun and 
interesting level of interaction. However, this pressure to increase input speed 

25 also encourages the user to cut corners which greatly harm the quality of 

translation. These cut-corners include abbreviating frequently repeated words, 
using pronouns, and leaving out subjects or verbs entirely, especially in Japanese. 
In order to facilitate fast input while discouraging these bad habits, the present 
invention shows the user a small window with a number of phrases which can 

30 immediately be entered into the text input line with a single click of the mouse as 
illustrated in Figure 5 1 . The phrases can also be accessed with keyboard shortcuts 
to make input even faster and simpler. 
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The present invention also provides a number of emoticons and 
illustrations that users can include in their messages, such as a smiley face and a 
heart that is illustrated in Figure 52. These are transparent to the translation 
engine and thus will have no effect on the translation quality. However, they have 
5 a substantial effect on the user-friendliness of the System and the total ability of 
the users to communicate and connect with each other. 

Action buttons are provided to enable a user to select from a menu of 
buttons which print out full sentences describing the user's attitude or actions. 
These range from the straightforward ("[User A] scratches his head.") to the cute 

1 0 ("[User A] blows [User B] a kiss! ") to the silly ("[User A] dances the Macarena."). 
Each action phrase is stored in each translated form, and is displayed to each user 
in the appropriate language. 

Special characters are designated which signal the translation engine of 
the System of the present invention not to translate part of the input. The user 

1 5 simply surrounds the text not to be translated with these special characters and the 
translation The System of the present invention ignores that section of the input 
and sends it through verbatim. These are important when entering names which 
are also common nouns (e.g. Nick, Young, the Giants, Los Angeles), when 
entering titles which the user does not want translated, and when users are 

20 discussing actual language use and language learning and need to mention specific 
examples. 

In addition to the special "do not translate" characters, users can construct 
a personal list of words and expressions which are not to be translated. With such 
a list, a user can record names and titles which he/she mentions frequently, 

25 removing the need to annotate them each time with the special characters. 

With the present invention, hyperlink dictionaries are provided and permit 
a user to immediately bring up the dictionary definition for any word by right 
clicking on that word. This is important for users because many of them will be 
language learners or people interested in other cultures and they will want the 

30 ability to see immediately the meaning of new words they encounter. In addition, 
once a user has typed an input and seen it go out in its translated for to the chat 
room, the user might feel some concern that the intended meaning of the sentence 
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was preserved in the translation. One way to reassure the user and give him/her 
the power to make sure the translation is correct is to make available the 
dictionary definitions of the translated words. 

The dictionary definitions shown can either be the literal dictionary entries 
in their entirety, or a check of local context can be used to determine which 
particular sense of the word is correct for the sentence. 

As a user enters a URL into the flow of chat, it is immediately recognized 
as such and is transformed into a hot link. This feature encourages users to trade 
links and information and will facilitate communication in the chat environment. 

In order to help users produce the best translations possible for their 
particular interests, the present invention gathers a number of personalizing 
features into one area of the web site called "My Translator" and illustrated in 
Figure 53. Users are encouraged to customize these features to their own 
particular needs. Not only does this produce better translations and greater user 
satisfaction it also encourages a sense of ownership in the translation technologies 
and will encourage repeated visits to the web site. The My Translator area 
includes the following: 

1) User-Built Custom Dictionary: The user can collect and store words, 
phrases, and names which they frequently discuss, search for, or see web pages 
about. 

2) User-Built "Do Not Translate" List: Names, titles, and phrases which 
the user usually does not want to be sent through the translation engine are 
collected in this list. 

The present invention permits chatters to set their keyboard shortcuts to 
enter certain words and phrases automatically. Frequent chatters will appreciate 
being able to store these shortcuts from session to session. 

If the user prefers to keep one of the subject- specific lexicons provided by 
the present invention as the default lexicon to use in web searches, web browsing, 
or chatting, this can be indicated in a "My Translator" area. Additionally, a 
general-purpose space is provided to the user to jot down notes while using the 
web browser, search engine, or chat rooms. 
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In Figure 54, a number of personalization features are unified and 
presented in one section for the user's convenience. Each user is able to have their 
area where they keep their personalized dictionary, a personal "do not translate" 
list, personally chosen default lexicon selection, personally defined keyboard- 
shortcuts, and a notepad. This effects how all the other parts of the System, such 
as the browser, the auction translator, the translator for wireless, translated chat, 
and all other translation-based applications will work. It summarizes the 
information in one area, so a user can have control and can improve the quality of 
the translation engine performance for his or her own particular uses. 

Referring now to Figure 55, the user starts on the splash page, which just 
has the company logo, and a choice of languages. They choose a language, so that 
they can go into the website and have it be written in the language that they speak. 
From there on, the rest of the pages are translated into all of the languages that we 
offer. The next page that they see after the Splash page is the Welcome page and 
then from that page, if they are a returning user they can log right in, go to the chat 
rooms page, choose a chat room and start chatting. If they are new users, there are 
three main options. Ideally they would go to the sign up form, fill all of that out, 
then go to the tutorial, learn how to use the chat and then go to the chat rooms 
page. If they are not convinced that they should sign up on the Welcome page, 
then they can go to the tour, find out more about it, and then go to the sign up 
form. There are also a lot of other pages on the site that anybody can access. The 
web site of the present invention is built with different language zones. Initially a 
user comes in, selects a language. The user can change the viewing language of 
the site at any time. 

Figure 56 illustrates that there are a plurality of different features in the 
chat application. The user can have conversations with other users by exchanging 
translated messages in the chatroom. The user can also open a private chat 
window in order to have a one-on-one conversation. The user can switch to 
another chatroom. The user can view profiles of other users and see their gender, 
location, age, occupation, fluent languages, country of origin, and personal 
message. They can edit their own personal profile, as well. And they can access 
the help section which includes a tutorial, translation tips, support form, and FAQ. 
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Figure 57 illustrates that there are different features in the chat room. 
Examples include keyboard shortcuts for entering special characters, icon 
messages so users can send pictures (such as a smiley face) as a message. 
Bilingual users can switch the enter language control and enter in different 
5 languages. When the user moves the mouse over components of the chatroom, a 
description of that component appears in the mouseover tip box. Moderators have 
extra features, such as silencing other users or even eliminating their accounts if 
they are too disruptive. 

There is also a special interface for people who want to enter in double- 
10 byte text but do not have double-byte operating system. For example, usually 
when a user enters text in Japanese, they enter the text phonetically and then hit 
return to select the characters they want to represent the phonetics. In Internet 
Explorer, if a user who does not have a Japanese operating system wants to enter 
text in Japanese, a small HTML window will appear and when the user hits return 
1 5 to select the characters they want to represent the phonetics, those characters will 
be automatically sent to the chatroom as well. In Netscape, there needs to be an 
extra HTML window in addition to a window where the user selects the 
characters. We will hide this extra window, so when the user selects the 
characters, it will look like it s being sent directly to the chat window instead of 
20 being sent to the intermediary HTML window and then to the chat window. 

A "Do Not Translate" feature is also provided. This is utilized when the 
user is entering a phrase and wants to have a part of it not translated. For example, 
if they type in "Apple Computer," in the English-French chatroom. They do not 
want "Apple" to be translated into "pomme," the French word for "apple." Right 
25 now we have a feature where they user can place "o" characters around whatever 
they don't want to have translated. There are two ways to do this: they can use the 
Do Not Translate Button or type in the characters themselves. The Do Not 
Translate button is on the chat, and when they hit that button, the "o" characters 
are always automatically inserted around the cursor. So when they type, they are 
30 actually typing between the "o" characters already instead of having to go and 
put the special characters around it themselves. But once they learn that those 
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characters keep the phrase from being translated, they can just type them in 
themselves instead of using the button. 

The Help process can be used when somebody enters a message that a 
user doesn't understand. The user can let the other party know that the user 

5 doesn't understand. We will go into more detail about this in Figure 58. 

The Help process is illustrated n Figure 58. For example, user A enters a 
message with a typo. User B views the translation but doesn't understand it. User 
B can click the Help button that is on User A's message and right away it will put 
up a message that says, "User A, I didn't understand your message. Please 

1 0 rephrase it." Both that message and the message that was misunderstood become 
highlighted. User A can re-enter the message so it can be translated again. 

A keyboard shortcut is also provided: when the up arrow is pressed the 
previous messages appear in the text box where the user enters its message. 
Instead of retyping the entire message User A can hit the up arrow, fix the typo in 

15 the previous message, and send it again. This provides a fast way for people to be 
able to let each other know when there is any miscommunication. The 
highlighting makes the process clearer and faster as well. By highlighting the 
message it becomes much easier to spot the misunderstood message. 

Figure 59 illustrates the "current member's box". In the upper right area 

20 of the chat room is a list of all of the members currently in the chat room. 

Different actions can be taken with the different members in the chat room. For 
example, a "personal information window" provides information on how to find a 
person. "Private chat" brings up a new window where a user chat one-on-one with 
that person and the "ignore button" is used to ignore a user and stop seeing their 

25 messages. If none of the names are selected all of these buttons are disabled. If a 
user's own name is selected then the user can see their own profile and edit it. 
The other buttons are disabled. 

If a user's own name is selected the "personal information" button can be 
clicked so the user see its own information and the user can also edit its own 

30 information. Another button is provided on the "personal information" window 
which brings up another window where a user can edit its own profile information. 
If another member's name is selected, then all three of those features work. A user 
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can see its profile, the user can chat one-on-one with that person in another 
window or the user can gray them out and stop seeing their messages. 

Switching language zones is illustrated in Figure 60. For example, if a 
user is a viewing a website in French and decides to go to a chat room where 

5 another language is used, a window pops up that says "You are moving to a 

different language zone would you like to view it in English or Japanese?" The 
French user then selects the new language and from then on views the site or chat 
room in the new language. 

Figure 61 is an overview of the browsing tool of the present invention. 

1 0 The browsing tool is a frame and has various features, more fully described in 
Figure 62. The browsing tool is utilized when a user on one website enters the 
URL of a website he or she would like to translate. The user then goes to that new 
website with the translations. At the bottom of the window the user clicks on a 
link on the page and goes to that new page which is also translated. A user can 

15 also enter a new URL into the browsing tool and goes to that site translated. 

Figure 62 lists some of the browsing tool features. The browsing tool 
permits a user to change what language the site is being translating to, including 
"none". Additionally, the user can customize it and have its own favorite links, set 
up its own look and feel, toggle between showing and hiding the original 

20 language. A multi-lingual dictionary pop-up is also provided. 

The foregoing description of a preferred embodiment of the invention has 
been presented for purposes of illustration and description. It is not intended to be 
exhaustive or to limit the invention to the precise forms disclosed. Obviously, 
many modifications and variations will be apparent to practitioners skilled in this 

25 art. It is intended that the scope of the invention be defined by the following 
claims and their equivalents. 
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